33 research outputs found
Recommended from our members
Reusing Ontologies to Enrich Semantically User Content in Web2.0: A Case Study on Folksonomies
Semantic Web and Web2.0 emerged during the past decade promising to achieve new frontiers for the Web. On the one hand, the Semantic Web is an interlinked web of data, supported by ontological semantics and allowing for intelligent applications such as semantic search and integration of heterogeneous content across systems and applications. On the other hand, Web2.0 represents the new technologies and paradigms that revolutionised the user engagement in content creation and introduced novel means towards social interaction. Bridging the gap between Web2.0 and the Semantic Web has been proposed as a means to better manage and interact with the large amounts of user contributed content, which is a new challenge for Web2.0. This thesis focuses on a popular paradigm of Web2.0, folksonomies. In particular, we investigate the semantic enrichment of folksonomy tagspaces by reusing ontologies available in the Semantic Web. We identify the need for methods that automatically apply semantic descriptions to user generated content without requiring user intervention or alteration of the current tagging paradigm. We use an iterative approach in order to identify the characteristics of folksonomies and the attributes of knowledge sources that influence the semantic enrichment of tagspaces. We build on the results of our experimental studies to implement a folksonomy enrichment algorithm, that given an input tagspace, automatically creates a semantic structure that describes the meaning and relations of tags. We introduce measures for the evaluation of enriched tagspaces and finally, we propose a search algorithm that exploits the semantic structures to improve folksonomy search
Bridging the gap between folksonomies and the semantic web: an experience report
Abstract. While folksonomies allow tagging of similar resources with a variety of tags, their content retrieval mechanisms are severely hampered by being agnostic to the relations that exist between these tags. To overcome this limitation, several methods have been proposed to find groups of implicitly inter-related tags. We believe that content retrieval can be further improved by making the relations between tags explicit. In this paper we propose the semantic enrichment of folksonomy tags with explicit relations by harvesting the Semantic Web, i.e., dynamically selecting and combining relevant bits of knowledge from online ontologies. Our experimental results show that, while semantic enrichment needs to be aware of the particular characteristics of folksonomies and the Semantic Web, it is beneficial for both.
Semantically enriching folksonomies with FLOR
While the increasing popularity of folksonomies has lead to a vast quantity of tagged data, resource retrieval in these systems is limited by them being agnostic to the meaning (i.e., semantics) of tags. Our goal is to automatically enrich folksonomy tags (and implicitly the related resources) with formal semantics by associating them to relevant concepts defined in online ontologies. We introduce FLOR, a mechanism for automatic folksonomy enrichment by combining knowledge from WordNet and online ontologies.We experimentally tested FLOR on tag sets drawn
from 226 Flickr photos and obtained a precision value of 93% and an approximate recall of 49%
Anticipating discussion activity on community forums
Attention economics is a vital component of the Social Web, where the sheer magnitude and rate at which social data is published forces web users to decide on what content to focus their attention on. By predicting popular posts on the Social Web, that contain lengthy discussions and debates, analysts can focus their attention more effectively on content that is deemed more influential. In this paper we present a two-step approach to anticipate discussions in community forums by a) identifying seed posts - i.e., posts that generate discussions, and b) predicting the length of these discussions. We explore the effectiveness of a range of features in anticipating discussions such as user and content features, and present focus features that capture the topical concentration of a user. For identifying seed posts we show that content features are better predictors than user features, while achieving an F1 value of 0.792 when using all features. For predicting discussion activity we find a positive correlation between the focus of the user and discussion volumes, and achieve an nDCG@1 value of 0.89 when predicting using user features
Improving search in folksonomies: a task based comparison of WordNet and ontologies
Search in folksonomies is hampered by the fact that the meaning of tags and their relations are not made explicit in the system. This is typically addressed by using knowledge sources (KS) to semantically enrich tagspaces, most notably WordNet and (online) ontologies. However, there is no insight of how the different characteristics of these KS contribute to search improvement in folksonomies. In this work we compare these two KS in the context of folksonomy search. We show that while WordNet leads to richer tag structures than online ontologies do, its fine-grained sense hierarchy renders these structures less effective in search compared to the ones generated from ontologies
Recommended from our members
What can be done with the Semantic Web? An overview of Watson-based applications
Thanks to the huge efforts deployed in the community for creating, building and generating semantic information for the Semantic Web, large amounts of machine processable knowledge are now openly available. Watson is an infrastructure component for the Semantic Web, a gateway that provides the necessary functions to support applications in using the Semantic Web. In this paper, we describe a number of applications relying on Watson, with the purpose of demonstrating what can be achieved with the Semantic Web nowadays and what sort of new, smart and useful features can be derived from the exploitation of this large, distributed and heterogeneous base of semantic information